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negative. The value is derived from the fact that the entire 
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management of all the other resources. Organizations are now 
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same degree of administration and control as is involved in 
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I. INTRODUCTION 


In the early 1950s the first general purpose electronic 
digital computers were introduced. They promised to simpli- 
fy, ease and enhance life in organizations of all kinds. 
Since then, they kept their promises. Their use in all types 
of organizations and commercial enterprises has grown at an 
astounding rate. As a result, it is hard to imagine how 
these enterprises could function today without computers. 

The "Computer Revolution” during the last few years has 
had a tremendous impact on the data and information used by 
enterprises. It resulted in a corresponding "Information 
Revolution”. Using the computer, an enterprise can store and 
retrieve vast amounts of data and information. Being able to 
process data and use information effectively is vital to 
both business and government organizations. The repository 
of data and information is called a "Data Base". It is the 
foundation of the entire computer-based processing system 
for an enterprise. Data and information are very important 
to enterprises. Thus, any improvement in the way data are 
handled is going to improve the ability to manage an enter- 
prise and the quality of services it provides. 

Database Sumo has emerged since 1960s in the data 
and information management field. Data ede dn and data 


independence were the focal points of this technology. This 


orientation of data management increased the productivity 
of the systems, particularly by improving data accessibility 
and reducing data redundancy. Additional thinking about 
organized approaches for the creation and support of data 
and information, emerged a new concept. The concept of data 
as an important corporate resource. Thus, data and informa- 
tion resource management have become today the current cree 
and focal points of data processing. 

Data Dictionary/Directory Systems  (DD/DS) made their 
debut in the mid-1970s with the management of database 
definitions as a central goal. They protected the integrity 
of definitions and also accurately documented the data 
resources. What started as a tool for description and docu- 
mentation of data within a database, soon became a service 
for data resource control. Thus, the DD/DS became a primary. 
administration tool, supporting in a comprehensive manner, 
the logical centralization of data resources. Both in theory 
and in practice, a DD/DS in its implemented form can support 
a manual system as well as a highly sophisticated automated 
one. It may also exist as an independent system as well as a 
depended one on a specific database management system. 

This thesis will explore and describe the role of the 
DD/DS in the systems development life cycle (SDLC). To bet- 
ter comprehend this role, key concepts of today’s complex 
data processing environment will be discussed. Included in 


this discussion is data and information concepts since their 
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early stages, database concept, system development life 
cycle concepts, and information resource management concept. 
A key component in the SDLC is the DD/DS. lts usefulness and 
in particular the benefits that can be derived from its use 
are explored and emphasized. The final discussion will be on 
a survey of the currently available DD/DS packages, offered 


by vendors, and their problems. 
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il. PRINCIPLES AND CONCEPTS 


A. INFORMATION 

A foundation for a modern enterprise is the information 
needed for its survival and prosperity. It is a resource pa- 
rallel in importance to labor, land and capital. Information 
may be defined as: 

data that have been processed into a form that it is 
meaningful to the recipient or user and is of real or 
perceived value in current or prospective decision 
processes. [Ref. 1] 

Information is a term which must be distinguished from 
the term data. Data are facts, records of an event that has 
occured or is about to take place. They undergo a trans- 
formation involving infusion with intelligence and they 
become information. Thus, although information originates 
from data it differs from them because all data may not 
become information. Also, what is information for one person 
may not be for another. Information adds to relevant  know- 
ledge, reduces uncertainty, and supports the decision-making 
process in an organization. 

Senn . [Ref. 2] further describes the general attributes 
of information, both for an item of information and for a 
set of information (Fig. 1). 

The information used in managing an enterprise, whether 


applied in a communication sense or in a decision-making 


context primarily comes from two sources: Internal and 
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Attributes of an Item of Information 





Accuracy Information is true or false accurate or in- 
accurate. 
Form It is the actual structure of information. It 


includes the dimensions of quantifiability, 

level of aggregation and medium of presenta- 
tion. Often a selection of one or the other 
alternate forms is’ dictated by the situation. 


Frequency Is a measure of how often information is 
collected, needed or produced. 


Breadth It defines its scope. Some information may 
be broad in scope, covering a large area of 
interest. Other information may be narrow in 


its scope. Usage determines the necessary 
breadth. ; 
Origin It is the source from which it is received, 


gathered, or produced. 


Time Horizon Information may be oriented toward the past, 
toward current events, or toward future 
activities and events. 


Attributes of a Set of Information 


Relevance Information is relevant if it is needed for 
a particular situation. Information needed 
at one time may not be relevant now. Like- 
wise information obtained "just in case it 
is needed" is not relevant. 


Completeness Complete information provides the user with 
all that needed to know about a particular 
situation. 


Timeliness Timely information is information that is 


available when it is needed. Further, it has 
not become out-dated through delay. 


-— 


Figure. 1--The Attributes of Information (Ref. 2) 


ES 


external. Identifying these sources and being able to 
evaluate the reliability of each is an important task for a 
manager. The manner with which this task is accomplished 
affects the value of the information and the purpose for 
which it will be used. 

Information processing as decision-oriented data proces- 
sing is aimed at ako data into information needed 
by managers. Today this transformation commonly accomplished 
via computers, which with computer programs, carry out the 


calculations, manipulations and data reduction. 


B. INFORMATION SYSTEMS 

Information used in managing an enterprise can be cate- 
gorized as belonging to three levels of hierarchy (Ref. 31] 
as illustrated in Fig. 2. 

On the first level there is the repetitive, predictable, 
routine, frequently produced, and frequently accessed infor- 
mation. This information is used for day-to-day operation by 
the first-level management, such as the accounting, payroll 
and credit departments and is known as operational informa- 
tion. This basic needed information is the most amenable to 
automation in an enterprise. 

On the second level there is the functional information. 
At this level similar information types are grouped together 
into functional units. This provides operating mangge 
with a degree of control and a broader scope of information 
about the business operation. 
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At the third level there is the executive-level informa- 
tion. It is designed to provide management with information 
supporting enterprise-wide policy-making activities and high 


level planning. This information is characterized by a high 


Special request Decision support 
< 
Enterprise 
wide management 
planning policy. 


Information Demand 
request Operating management reports 


functional information < 
grouping for control 


Transaction Schedule 
reports 





Figure 2.--The Information System Hierarchy (Ref. 3] 
degree of summarization. Automation at this level enables 
sophisticated manipulation of available information, higher 
degrees of integration, and more complex analytical reports 
on a more timely basis. 7 


Due to the needs for information on a continuing basis, 


it is necessary to develop a subsystem for processing and 
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handling the information resource alone. An Information 
System may be defined as: 
a system which uses personnel, operating procedures, and 
data processing subsystems to collect and process data 
and disseminate information in an organization [Ref. 4]. 


Hardware, software, data, personnel and procedures are 


the basic elements of an information system (Fig. 3). 


Information Processing 


Hardware Software 


Personnel 


Procedures 





Figure 3--Elements of an Information System 
Information systems have evolved from manual systems to 
automated systems; database systems; on-line systems; to 
user-friendly, and highly sophisticated systems of today's. 
Each stage of evolution involved a transition brought about 
a changing environment in response to problems. Each stage 
solved some problems and reduced personnel requirements, but 


opened the door to a whole new set of problems. Each stage 


16 


shortened or broadened the information cycle, but opened up 
possibilities and opportunities not feasible previously. En 
a few years when it is expected that the fifth generation 
computer systems will be in wide use, information processing 
Systems will be the central tools in all areas of social 
activity. They will include economics, industry, art and 
science, administration, international relations, education, 
culture and daily life. Such information systems will be re- 
quired to meet the new needs generated by environmental 
changes. They will not only be expected to play active roles 
in the resolving of anticipated social bottlenecks but also 
to advance society along a more desirable path through the 


effective utilization of their advanced capabilities. 


C. FILE ENVIRONMENT 

The success of an information system depends on proper 
management of data. Even if the hardware and software ele- 
ments are well designed and operating properly, the SS of 
the system will be marginal if the underlying data are not 
reliable. Similarly, if the application programs that need 
the data cannot get at them, the system may be limited in 
usefulness. 

When organizations first introduced computers, they 
developed the so called file systems. A' file system whether 
On cards, tapes or disks may be defined as: | 

designed to receive, store and produce specific infor- 


mation/output that requires specific input. A file 
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system is a collection of individual files of organized 
records that serves the current information needs of 
specific application (Ref. 35]. 

In a file system the smallest element in the hierarchy 
of data is the data item. The data item is a fact or state- 
ment about an entity (person, place, thing, or event). Bor 
the data items to be distinguished from each other, should 
have clearly identified characteristics of name, size and 
type specification. Related data items are then grouped 
together to form records, which in turn are grouped into 
files. Data records may be fixed or variable, whether the 
number and position of data items in the record are fixed or 
variable in length. When finally the file is established and 
its definition is decided, it becomes fixed and all records 
must conform to the definition. Data items and records that 
form a file in a student situation, are shown in Fig. 4-7. 

In the file oriented environment two types of files were 
commonly developed and used. The first, Master files, hold 
historical overview of past events, and also show the status 


Of items of current interest. The second, Transaction files, 


capture data about occuring events, and are used to update 
the Master files. Into these files the data are stored se- 
quentially or randomly, and the relation between items and 


records is physical. 
As organizations grow and gain experience with data pro- 
cessing they discovered the major drawbacks of the tile 


environment. They had been engaged to use files for only one 
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Entity : Student 


Data Item Name Size Type 


Student Name X(alphanumeric) Tim Walker.. 


Identification 9(numeric) 
Number 





Figure 4.--Relationship between an Entity and two 
Data Items 


Record : Student ( a 60-character record ) 


Data Item Name Size Type Value 


Student Name Tim Walker 


Identification 
Number 22444 


Address 311 Omaha St. 


City Monterey 


State CA 


Zip Code 





Figure 5.--An Instance of the Record Student 
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Record Student Address State Zip 
Name Code 


Tim Walker Omaha St. Monterey 


John Nolan River St. Monterey 


Bill Perry Lake St. Monterey 


Nancy Norm Demon St. Monterey 





Figure 6.--A Student File (collection of records) 


Data Item 1 Data Item 2 Data Item n 


Record 1 Record 1 Record 1 


Record 1 Record 2 Record m 


File of Data 





Figure 7.--Data Hierarchy 
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or two applications, and they realized there was a limited 
amount of data sharing across applications. Thus, they were 
forced to have data and file redundancy. Later, data and 
file duplication led them to problems of data integrity and 
most of the times to problems of data inconsistency and 
inaccessability. Meanwhile, the cost of labor was increasing 
steadily, while the cost of computers was decreasing drama- 
tically. Now, there was the potential for trading people 
resources for machine resources. For these reasons, and also 
to solve the kinds of problems that the file environment 


created, database systems were developed. 


D. DATABASE ENVIRONMENT 

The basic difference between a conventional file or ap- 
plication system and a database system is in the philosophy. 
A database is a collection of related data about an enter- 
prise that can be processed aS an integrated whole. The 
database does not aoe information as information, but acts 
stores data to be used in generating multiple levels and 
types of information. In a database the data definitions and 
the relationships between data are distinguished from the 
procedural statements of a program. The database is a self- 
describing collection of integrated files (Ref. 61], because 
it contains within itself, a description of its structure. 


To understand and appreciate these concepts, consider the 


systems shown in Fig. 8. In the file processing system each 


Zi 


Faculty 


Data Reports 


File 





Class 
Scheduling |————» Reports 


System 





Student 
Data Reports 


File 





di 


Three File Processing (Pre-Database) Systems 


Definition —» |BReports 





Faculty 
Data 






«—»| DBMS €—» |Scheduling| —^?lReports 


Student 
Data 


—> | Reports 





LLL, 





A Database Processing System 


Figure 8. --Difíerence between a File and a Database System 


file exists independently. To obtain combined information, a 
new program must be written extracting data from correspond- 
ing files. Instead, in a database system the files have been 
integrated into a database, that is, processed independently 
from the application programs. An intermediate system is 
also used called database management system (DBMS) which is 
a large and complex program. The DBMS stores data and also a 
description of the format of data and n called upon by the 
programs to access the database. 

The database approach offers a lot of advantages. These 
advantages and the corresponding disadvantages of database 
processing can be seen in Fig. 9. Nevertheless, even with 
these considerable disadvantages, the advantages of using 
database technology make it extremely desirable in today's 
data processing environment. 

Án organization database system has five components: (1) 
Hardware, (2) application and utility programs, (3) data, 
(4) people, and (5) procedures. People for a database system 
are considered the users, Operations personnel, systems 
development personnel and the database administration staff 
(DBA). DBA serves primarily as a protector of the database, 
resolving user's conflicts. 


To design a database, one requires conceptual represen- 


tations of the real world. The ~- database design can be 
characterized as a scientific, intuitive, artistic and 
iterative process. It is divided into two phases: logical 
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Advantages of Database Processing 





. More information can be produced from the same 
amount of data 


. Consistent information can be supplied for the 
decision making process 


. Elimination or reduction of data duplication 

. Program/data independence 

. Application programs can be developed, maintai- 
ned, and enhanced faster and more economically 
with fewer skilled personnel 

. Better data management 


. Security controls can be applied 


. Representation of record relationships 


Disadvantages of Database Processing 





. Expensive, due to DBMS more hardware and high- 
er operating costs 


. Complex 
. Backup and recovery difficult 


. Integration of data, and hence centralization, 
increases vulnerability to failure 


Figure 9.--Advantages and Disadvantages of Database 
Processing 
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design and physical design. A logical database design 
specifies the logical format of the database. The records to 
be maintained, their contents, and relationships among those 
records are specified. The logical design is usually called 
conceptual schema, or logical schema [Ref. 71]. The physical 
design is the phase of transforming the logical schema into 
the particular data constructs that are available with the 
DBMS to be used. 

Database design should satisfy today's anticipated as 
well as future needs for information. It should be easy to 
modify, and expandable. Before inserting any data in the da- 
tabase, the data should be examined for validity and remain 
correct once is stored. Finally, it should provide security 
and privacy facilities to different users. Only authorized 
users should have access to the data stored in the database. 

Á database provides three views of data: (1) schema or 
(conceptual view), (2) subschema or (external/user) view, 
and (3) internal (physical view). The schema is the complete 
logical view of the data and describes all the data in the 
database. The subschema defines a subset of the schema which 
is accessed by a specific application program or user. Sub 
schemas can overlap each other and can also reorganize the 
schema, depending on the DBMS used. The physical view is the 
form of the data as it is arranged “and allocated to files. 
All these views must be defined before the database  proces- 


Sing occur. Usually the DBA defines the schema and 
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subschema. When the database is defined , the physical view 
is created automatically by the DBMS. 

Having multiple views of data means that even though the 
data is centralized and shared, it can be tailored to the 
needs of the application. Then data can appear to each user 


in a more familiar and useful format. 


E. SYSTEMS DEVECOPMENT LIFE CCEE 
When information systems are to be installed into orga- 
nizations, they must be the product of careful planning. 
Systems developers must have in mind, that they have to 
provide the right system and also that the system must work 
right; From the time the need for an information system is 
first perceived to the time that the system is actually 
delivered, there are many stages that take EP. These 
stages comprise the Information System Development Life 

Cycle (SDLC) and commonly are: 
CEIS Initiation phase or system planning phase. 

This stage deals with formulation of a master plan 
of the system, (i.e. perception of the need, clari- 


fication of purpose, technical, economical, and 
operational feasibility). 


(2907 Requirements definition and analysis phase. 
The system is first partitioned into subsystems in 
order each part can be studied effectively. Then 


user information needs are determined and described, 
and detailed system requirements are established. 


C335 Logical system design phase. 
The functional specifications of the system are 
formulated, ( specification of procedures, input/ 


output, files and databases). 


(4). Physical system implementation phase. 
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(5). 


(6). 


(7). 


During this stage programs are coded and construc- 
ted. Record formats, data structures, files and 
databases are developed, and specific hardware devi- 
ces are selected. 


Testing phase. 

The implementation of programs, procedures, must 
be tested to ensure that the system is running 
properly. 


Implementation and evaluation phase. 
During this stage users are trained, and  evalua- 
tions are made of the system and of how it operates. 


Maintenance phase. 
The SDLC is not completed after the installation 


and implementation phase. Improvements must be made 
continually to correct errors, in order to meet new 
management needs, or to take advantage of new 


technological improvements. 


The phases of the SDLC are fairly universal and are 


accepted by management and the data processing community in 


general. 


However, in this high level categorization, there 


is no clear beginning and end for each phase. 


No system can be effective if it does not meet manage- 


ment and, especially, user needs. Guidelines that analysts 


and designers must follow to avoid pitfalls and problems 


during the SDLC can be summarized as: 


en. 


e). 


The development of an information system should be 
tied to overall organization goals and objectives, 
whether these are profit maximization, cost minimi- 
zation, or growth. 


Useful approaches for system development are the 
top-down and bottom-up approaches or a combination 
of both. The top-down approach is more effective to 
generate the general scope" on how the system will 
evolve to support organization goals and objectives. 
The bottom-up approach may be followed within the 
overall development process. 
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( 3) Before a systems request is approved and included in 
the master plan, assurance is needed that it can be 
accomplished within reasonable technical, economical 
and operational constraints. 


(4). The determination and specification of user require- 
ments must be accurate and complete, otherwise the 
new system has a great potential of failure. 


CS. The system should be designed in the most cost 
effective and efficient manner. The design should 
provide efficiency, accuracy, and flexibility. The 
logical design requires a knowledge of the hardware 
and equipment that will be available for the project 
as well as, insight into potential users and pur- 
poses of the output. Users and systems developers 
alike should understand how the system was created 
and the techniques used to design it. 


(62): Software development which is time-consuming, costly 
and an error-prone process must be iterative with 
feed-back from each stage to the previous one. 


CZ: Evaluating the new system is 3a critical step in 
learning how the system is operating and where chan- 
ges must be made. Evaluating the system impacts 


involves looking at performance and at the effect 
of the applications within the information system. 


(8). Finally the operation and maintenance stage of the 
SDLC counts for the major portion of the total cost. 
It is also an iterative process where system persons 
must cycle back and forth, acquiring additional in- 
formation about design questions as the needs arise 
or as the problems change. 


RE INFORMATION RESOURCE MANAGEMENT 


T Definition 





Modern organizations are becoming increasingly comp- 
lex, because of size growth, increased specialization, 
higher technology levels in products and processes, and the 


changing structure due to internal and external pressures. 


The proliferation of computers, and the introduction of 
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automated information systems into management has been a 
major development providing a new tool for improving opera- 
tions and planning. On the other hand, it has created an 
increasing need to manage these systems more effectively and 
efficiently. Resources into an organization are managed to 
optimize their utilization. To manage resources means to 
plan for, allocate, maintain and conserve, prudently exploit 
effectively employ, and ince those resources. But 
primarily it is necessary to understand the resource, to 
know when and what is available, its source and also its 
destination. The Information Resource Management concept 
(IRM), was the result from the recognition for the need of 
managing information like any other resource in the organi- 
zation. IRM may be defined as : 
whatever policy, action/procedure concerning information 
(both automated and non automated), which management 
establishes that serve the overall current and future 
needs of the enterprise. Such policies, would include 
consideration of availability, timeliness, accuracy, 
integrity, privacy, security, auditability, ownership, 
use and cost-effectiveness (Ref. 81. 

Information required to manage the organization  re- 
sources is derived from data. Thus, data is a resource 
that must be understood, conserved, exploited, employed and 
integrated. IRM is a new concept, which shifts the design 
of traditional processing systems around the data to be 
processed, as the central core. The recognition of the IRM 


concept, and the awareness of managers that data is an 


Organizational resource, has led them to establish a set of 


29 


management procedures and technical functions, which called 
"database administration". 
2: Database Administration 

Database Administration (DBA) has been staffed from 
the most of organizations to protect the database, while at 
the same time to maximize benefits for the users. It is the 
agency for centralization and exercise of control over the 
database. Database nea may be a person, or a group 
of persons for large,complex, and widely shared databases. It 
involves several specialties. It includes information system 
analysts, data structure and data organization designers, 
security officers, recovery specialists, auditors, and ac- 
countants. DBA encompasses all the technical and management 
activities required for organizing, maintaining and direct- 
ing the database environment, or otherwise, all the data of 
the organization which is organized and controlled using a 
database technology [Ref. 9]. The DBA negotiates with the 
users; the application analysts, for the storage of data re- 
quired by the applications; for the permissible use of that 
stored data; and for the cost/benefit economics and priori- 
ties for that stored data. The DBA, the security officers, 
and the database system maintainers, use the various data 
descriptive and control languages, and utility programs, to 
enter policies, procedures, and controls into the database 
system. The DBA should be the only individual in the organi- 


zation concerned with the computer’s model of the data. 
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The database environment is a combination of hard- 
ware and software that supports the user's interface and the 
E tsbase administrator's interface. It consists of the data- 
base, the database administrator, the software tools used in 
data administration and data processing, and the users. Into 
this environment the DBA functions as a leader in planning, 
design, development, implementation, testing, documentation, 
Operation, and maintenance of the entire database. The 
functions performed by the DBA and the corresponding respon- 
sibilities can be categorized into three general areas: e 
management of the data activity, (2) management of the data- 
base structure, and (3) the management of the DBMS itself. 
These functions and responsibilities include: database defi- 
nition/redefinition; selection and procurement of hardware/ 
software services; database design/redesign; database crea- 
tion activities; database security and integrity functions; 
database maintenance and management; performance monitoring 
and evaluation; database enforcement activities; liaison 
activities with users/systems and application analysts; and 
training of users. A DBA's staff configuration and its 
its responsibilities is given in Fig. 10. The members have 
various duties in different stages of the database develop- 
ment. The size of this configuration depends on the size of 
the enterprise, the complexity of the applications to be run 
with the database, the complexity of the chosen, the level 


of the user's sophistication and the phase of the project. 


on 


2 persons: Expert in communicating 


Operations with the operations staff, pos- 
sibly previous operations staff 
Expert members. 


4 persons: 1 person software exper- 


Systems tise major, and hardware minor. 
1 person hardware expertise major 
Expert and software expertise minor. 





2 persons experts in DBMS. 


4 persons: 2 persons expert in ap- 
Applications plications, ( possibly previous 
systems analysts from the appli- 
Expert cation group ).2 persons experts 
in programming applications 
(possibly previous application 
DBA programmers ). 


2 persons: experts in maintaining 
data dictionary, security defi- 
Librarian nitions, previous experience in 
user community necessary, arti- 
culate in native language. 


2 persons: experts in formatting 


Query screens for on-line transcactions 
Language in implementing security, and in 
Expert authorizations for on-line 


transcactions, experts in query 
languages, previous experience in 
user community. 


1 person: expert in locating defi- 
ciencies in usage of database, in 
Auditor regard to security, privacy, and 
charging mechanism, on the part 
of DP operations and application 
programmers; expertise in ac- 
counting, preferably from the 
user community with some applica- 
tion programming background. 


Figure 10.--DBA's Staff at Full Capacity and its 
Responsibilities (Ref. 10) 
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J. Database Administration Tools 
a. Database Management Systems 

A DBMS is a software tool that provides an inte- 
grated source of data for multiple users, while presenting 
different views of the data to different users. It can be 
considered as generalized software which provides a single 
flexible facility for accomodating different data files and 
operations, while demanding less programming effort than 
conventional programming languages. It features easy access 
to the data, facilities for storage and maintenance of large 
volumes of data, and most importantly, the capability for 
sharing the data resources among different types of users. 

DBMS products vary in the degree to which they 
provide their functions (Fig. 11). Currently, no commercial 
DBMS provides all these functions entirely satisfactory. 
These functions are necessary and important however, and 
this situation should change as DBMS products evolve and as 
new products are developed. 

Although most database experts agree that these 
nine functions are required, they do not agree how some of 
these functions should be performed. Some people believe 
that these functions should be provided by the DBMS automa- 
tically. Other believe that some of them should be performed 
by application programs or by users. In the entire database 
development cycle, probably nothing is more important than 


the DBMS evaluation and selection. It is true that DBMS 
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. Store, retrieve, and update data 


. Provide integrity services to enforce database 
constraints 


. Provide a user-accessible catalog of data descriptions 
. Control concurrent processing 

. Support logical transcactions 

. Recover from failure 

. Provide security facilities 

. Interface with communications control programs 


. Provide utility services 


Figure 11.--Functions of a DBMS 

evaluation is meaningful only after DP managers or DBA have 
obtained a clear and concise picture about what kind of DBMS 
will be most beneficial to their particular organization. An 
intelligence selection of a DBMS can almost guarantee a suc- 
cessful implementation. As many other software evaluations, 
the selection of a DBMS should include the following items 
in a checklist with proper weighting factors: Vendor service 
capability; technical report; personnel training facility; 
software interface compatibility; hardware requirements; 
documentation; and security/recovery facility. 

The outputs of the logical database design, the 
system requirements, and the preliminary design of programs, 


are the inputs to the physical database design. However, 


34 


since the specific outputs vary from one DBMS to another, we 
must have an insight of the different database models 
evolved since their earlier stages. 

A database model is a vocabulary for describing 
the structure and processing of a database. Database models 
can be used for both logical and physical database design, 
and used to categorize DBMS products (Ref. 111]. The database 
models have two major components. The data definition lang- 
uage (DDL), which is a vocabulary for defining the structure 
of the database, and the data manipulation language (DML) 
which is a vocabulary for describing the processing of the 
database. DML are distinguished into procedural DML and non- 
procedural. Procedural DML is a language for describing 
actions to be performed on the database. Nonprocedural DML 
is a language for describing the data that is wanted without 
describing how to obtain it. 

The earliest DBMS were developed in the 1960s, 
based on hierarchical, network, and inverted-tree data 
models. Additional improvements led to the currently exist- 
ing models portrayed in Fig. 12. 

Six common and useful database models are 
portrayed. The models are arranged having an orientation tor 
humans and human meaning, to machines and machine specitica- 
tions. Their characteristics and application is also shown 
Mig. 13. As we can see only three of these models are 


actually DBMS products. 
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Human (logical) Machine (physical) 
< » 


Semantic Entity Relational  CODASYL  DBMS- 


Data Relational Data DBTG Specific 


Model Model (E-R) Model Model Model 


ANSI / X3 7 SPARC 





Figure 12--Current DBMS Models 
DBMS has significant features and offers advan- 
tages in reducing the redundancy of stored data, avoiding 
incosistencies in stored data, allowing for the sharing of 
stored data, maintaining data integrity, enforcing standards 
and providing security over data. Nevertheless, DBMS is one 
thing, and adopting a database approach is another. DBMS 
does not necessarily resolve all the problems related to the 
DBA within an enterprise. In general the hazards of using 
the DBMS are political and economical (Ref. 12] and can be 
grouped as: 
Political: Will top management support the creation of 
database covering the entire organization structure and 
insist on organization-wide cooperation ? 
Legal: Legal complications can sometimes arise when two 
or more separate files are “joined together into an 
integrated database. 
Personnel: How many people of what skills must be hired 


for effective use of the DBMS. 
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Type Characteristics Application 
Semantic DDL language for storing meaning. Logical 
Data High level DML database 
Model No DBMS based on this model. dasign 
Entity- Entities and relationships modeled Logical and 
Relatio- as data. physical 
ship E-R diagrams graphically show database 
Model relationships. design 

No DML 
Relational Data represented as tables. Logical and 
Model Relationships implied by data. physical 

Dynamic data relationships. database 

Procedural and nonprocedural DML. design 

A few DBMS based on this model. 
CODASYL Oldest data model. Physical 
DBTG Relationships must be predefined. database 
Model Procedural DML. design 

Extensive application in industry. 

Many DBMS based on this model. 
DBMS- Models vary widely. Physical 
Specific DDL and DML closely conform database 
Model to features of the DBMS. design 
ANSI/X3/ DBMS model instead of database Design 
SPARC model. Three schema model. model of 
Data Can support a variety of DBMS 
Model different data models. 

Figure 13. --Characteristics and Application 


of Data Models 
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. Training : What training courses must be provided to 
prepare personnel for database development and use ? 
Training of user personnel must be given as much consi- 
deration as training of the developers. 


. System software : What other software packages must be 
purchased for effective use of the DBMS. 


. Application software : All database implementations re- 
quire the development of appiication software. Software 
aids provided by the DBMS to reduce the skill level 
and time required to develop application software is 
important to the overall cost of DBMS use. 

. Hardware : What additional hardware must be added to 
make effective use of the DBMS. 

b. Data Dictionary / Directory. Systems 

A successful enterprise is the mirror of an ef- 
fective management. And a vigorous management is cognizant 
of the value of corporate resources, and optimizes and safe- 
guards them accordingly. Until quite resently only a small 
percentage of enterprises exercised as tight a control over 
their data resources as over their other resources. And as 
data resources continue to increase, it is becoming apparent 
that the DBMS is not the panacea to all the problems about 
the management of data, particularly since not all the data 
is automated. DBMSs encouraged centralization of perspective 
on how data should be organized and used. A recent trend is 
to use anew automated tool which encourages centralized 
perspective on how information about these data should be 
organized and used. This tool ais ~ called Data Dictionary/ 


Directory System  (DD/DS). The DD/DS performs some of the 


same functions as the DBMS, and provides benefits parallel 
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to those attributed to the use of a DBMS. However, it has 
begun to be recognized as the more general of the two, since 
the benefits derived from its use are related to the total 
data resources of an enterprise. It provides the central 
control of data resources in a uniform manner across organi- 
zation lines. It pertains not only to database systems but 
increasingly to non database systems as well, and it is 
further broadened by its support for job streams, structured 
Systems, on-line environments and system design. It may be 
considered as the primary software tool for a DA, providing 
the possibility in an enterprise with no need for a database 


administrator (DBA). 


Ss Data management Software Tools 
There are a great number of commercial software 


data management products available today, which can assist 


the DA, or the DBA in an enterprise. These products can be 
classified into four general categories. Those based on 


physical linkage between files, those based on inversion of 
data values, the decision support systems (DSSs), and the 
file-pass data manipulation systems (DMSs). The characteris- 
tics of these management software tools, and the current 


Offerings by vendors are shown in Fig. 14-15. 


Ross (Ref. 131], categorizes these four types oft 
Management systems, into three conceptual classes : Designer 
packages, self-contained packages, and implementation and 
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(continued) 
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Figure 15--Classification of Commercially Available 
Mainframe Data Management Software (Ref. 13] 
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access tools. The examination of the various system types 
reveals that the physically linked DBMSs are highly complex, 
but the benefit of their use is a high degree of performance 
tunability. These packages are commonly oriented toward DP 
professionals, like the application designers. The inverted 
DBMSs and the DSSs, are much less complex, more flexible in 
meeting application requirements and can be characterized as 
self-contained systems. Finally, the file-pass DMSs, rely on 
some other system component (an access method, such as ISAM) 
.or DMSs, facilitate the task of data delivery by circum- 
venting the need for application programming, and labaled as 


tools for program implementation and data access. 


G: SUMMARY 

The introduction of the computer technology, and the 
Mi teration. of its usage into organizations, led them at 
ealier stages to believe that all their problems could be 
solved by machines. Later, they realized that most of the 
problems were not technical, but rather administrative and 
procedural. Thus, many concepts such as, database, IRM, were 
developed and implemented for more efficient operation and 
control. 

Data has been an important element in the operation of 
an organization throughout history. Nevertheless, it is not 
until recent years, with the explosive A of 
computers, that the value of data as a resource has been 


fully recognized. 
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The human function responsible for the proper  admini- 
stration, control, and coordination of all data-related 
activities into an organization is the DA. À primary soft- 
ware tool, an automated facility, that supports the data 
administration function in managing data as a resource is 


the Data Dictionary/Directory System. 
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TII. DATA DICTIONARY/DIRECTORY SYSTEMS 


A. DEFINITIONS AND BASIC PRINCIPLES 
There is a wide range of definitions for Data Dictionary 
/Directory Systems (DD/DS). Due to the increasing interest 
and the rapidly evolving nature of this field in recent 
years, terminology is somewhat confusing. One author speaks 
of a Data Dictionary System (DDS) or System Resources 
Dictionary, while another refers to DD/DS or Data Element 
Dictionary/Directory System, (DED/DS). Characteristic defi- 
nitions include those of: 
Leong-Hong and Marron (1977): 
the DED/DS is a software tool that provides the means 
for defining and describing the characteristics of a 
database, as opposed to the contents of a database [Ref. 
141; / 
National Bureau of Standards (NBS) (1978): 
the DED/DS is considered as a resource manager. It is an 
integrated repository that provides data necessary for 
managing data. Data management includes the planning, 
control, direction, and organization of data (Ref. 15]; 


Cardenas (1979): 


the DDS is a centralized repository of data about data 
(Ref. 161; 


Leong-Hong and Plagman (1982): 
the DD/DS is a system that is designed to support com- 
prehensively the logical centralization of data, about 
data (metadata), (Ref. 171; à 


Further, they elucidate the difference between the two 


types of metadata, "dictionary" and "directory". Dictionary 
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metadata provides information which describes what the data 
is what it means, and what exists. Directory metadata desc- 
ribes where the data is located, and how it can be accessed. 
Van Duyn (1982): 
the DD/DS is a collection of data elements, structure 
informational entities, characteristics, and locations 
of data, (Ref. 18]. 
He also, proposes that the current concept of a data 


dictionary system is composed of the following: 


Data Catalog: a structured listing of data elements, 
with or without a description of the listed elements. 


Data Dictionary: an organized compilation of data ele- 
ments, data attributes, structure, and characteristics. 


Data Directory: an orderly listing of data elements 
names, identifiers, locations and physical characteri- 
stics of these data. 


Data Dictionary/Directory: which combines the features 
of a data dictionary and data directory. 


Allen, Loomis and Mannino (1982): 

a DD/DS is an automated information system. It helps to 
achieve control of the data resource, by providing an 
inventory of that resource. It helps to control the cost 
of developing and maintaining applications. Finally ie 
can provide for independence of metadata across comput- 
ing environments, improving resiliency to the effects of 
hardware and software changes, (Ref. 19]. 

The variety of definitions provides some idea of the 
evolving scope and increasing complexity of DD/DS. The terms 
"data dictionary", "system resources dictionary", "directory 
of data definitions", appear in common usage. The DD/DS does 


not contain only definitions for data, nor it is merely a 


dictionary. It is not only a "system resources dictionary", 


46 


because many  DD/DSs contain corporate information about 
users and procedures, that are not really system resources 
at all. 

Unfortunately, -in spite of these shortcomings, no better 
designator has yet been suggested, and the terminology 
remains confusing. Part of this problem, is due to the fact, 
that DD/DS technology has evolved very rapidly in recent 
years. Also, since DBMSs were developed before DD/DSs, there 
is a natural tendency to view DD/DS as subordinate to DBMS. 
Especially, in cases where a DD/DS-like function is part of 
the DBMS, or where it cooperates with a DBMS. Nevertheless, 
the increasing interest and improvements in DD/DS, and the 
development of DD/DS independent from DBMS, has caused it 
to evolve into a complex software tool which should be 
considered more efficient than the DBMS. 

In this thesis it is assumed better to describe the 
features and functions of a DD/DS, rather than to precisely 
define what the DD/DS really is. References to DD/DS will 
imply that dictionary and directory functions are available, 
unless specifically stated otherwise. 

The basic principles that will serve as a foundation in 
further discussion include the following: 

: Data has positive or negative value. The value of the 
data derives from the fact that the entire enterprise 
depends on its availability for the proper management 
of all other resources. Thus, it must be treated as a 


resource. The management and control of data resources 
begins with a proper definition and description of 
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data. A DD/ds is a tool for the control and management 
of data as a resource. 


TO manage data as a resource it is basic that data 
about data must be clearly specified, easily accessible 
and well controlled. These data are called metadata. 
These are data objects, that in a data processing 
environment are represented in the form of elements, 
records, files or databases. Metadata is not user data, 
but identifies, defines and describes the characteri- 
stics of the latter. It describes the data resources 
Of an organization. A DD/DS contains two types of meta- 
data: Dictionary and Directory metadata. Dictionary 
metadata describes the data, and defines their meaning 
and structure. Directory metadata describes where the 
the data is stored, and how internally represented and 
accessed. 


A collection of related metadata comprises the metadata 
database (Ref. 20]. It consists of a database that 
contains descriptive and definitional information about 
the user database. It has basically the same characte- 
ristics of a user database. To achieve the goals of 
managing data as a resource it requires proper manage- 
ment. That is, planning for the design, implementation, 
maintenance, utilization and control. This implies that 
established lines of responsibility and authority; 
formal rules and detailed procedures to guide metadata- 
related activities; common procedures for collection, 
update, and maintenance; and common procedures for 
access control to the metadata must be developed. The 
DD/DS is the basic tool for managing the metadata 
database. 


The DD/DS is divided into three categories, based 
on the scope of control exercised through metadata 
management: Active, Potentially Active, and Passive, 
[Ref. ^21]. A DD/DS is said to be active with respect 
to a program or process if that program or process is 
fully dependent on the DD/DS for its metadata. A DD/DS 
is said to be passive if it does not generate metadata 
and does not have control over where and how ai user or 
processing component obtains the required metadata. A 
potentially active DD/DS provides the capability of 
producing the metadata for a given program or process. 
A potentially active DD/DS can be extented to active 
through supportive administrative procedures. Many of 
the currently commercially available DD/DSs are of this 
type. In practice, the concept of active/passive DD/DS 
refers to interfaces that it provides to other software 
packages. A DD/DS with active interfaces can better 
serve the goals of managing data as a resource. 
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B. FEATURES OF DD/DS 

The underlying design philosophy of a DD/DS is that it 
will serve the needs of a wide variety of users in an 
organization. Six groups are identified that can/should 


share the DD/DS: 


A Data  Administrators/Database Administrators (DA/DBA), 
who use the system to control the data resource, 
iplementing standards, designing, monitoring, and 
restructuring. 


. System Analysts and Programmers, who share the DD/DS to 
facilitate system design and system implementation 
activities. 

. Operations Staff who retrieve information about jobs. 

: Data-Processing Management, who receive the high-level 
impact and summary reports about data usage from the 


DD/DS. 


: End-Users, who access the DD/DS to obtain information 
about existing data and system resources. 


: Auditors, who examine system documentation provided 
through the DD/DS. 


Each of the users will contribute metadata to the DD/DS, 
having different needs, and different logical views of the 
metadata. The necessity to support such diverse logical 
views of metadata among users requires the database approach 
in the design of a DD/DS. This, includes three steps: (1) 
identify the entities (meta-entities), (2) define them, and 
(3) identify and describe their relationships. 

Allen, et. al. (Ref. 22], delineates the logical struc- 


ture of a typical DD/DS's database. The DD/DS is represented 


by a network data model of entities, relationships, and 


49 


attributes. An entity is an object of interest about which 
information is collected. A relationship is an association 
between one or more entities. An attribute is a characteris- 
tic relating to an entity or relationship. Attributes which 
can be used for an entity or a relationship in a DD/DS are: 
type, range, length, unit of measure, usage, language names, 
repetitions, keys, defaults, and display formats. It is es- 
sential that the structural characteristics of the DD/DS be 
described in logical terms. This will provide a clearer in- 
Sight into what kind of metadata are supported by it. The 
relationships between entities of a typical DD/DS are il- 
lustrated in Fig. 16. The user entity although not shown may 
be related to nearly all the other entities. Thereby, repre- 
senting the user-subschema, user-process, user-terminal, 
user-transaction, and user-report relationships. 

A recommended classification of entities (meta-entities) 
and attributes, (Ref. 23], is also listed in Fig. 17,18. The 


entities are classified into three groups: (1) data entities 


(2) system or processing entities, and (3) environmental 
entities. The attributes into seven groups: (1) identifriaeas 
tion, (2) representation, (3) statistical, (4) relationship, 
(5) control, (6) physical storage media, and (7) user- 


defined attributes. 
The last category of user-defined entities, relation- 
ships and attributes is one of the most important features 


of the DD/DS. It is the expansibility or extensibility 
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Figure 16--Logical Structure of a Typical DD/DS [Ref. 22] 
with Expansibility Feature. 
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Figure 17--Classification of Meta-Entities (Ref. 23] 
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Figure 18--Entities/Attributes Matrix (Ref. 23] 
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feature. It is the capability, in a more dynamic approach, 
that allows a user to expand the vendor standard meta-entity 
structure, to suit specific corporate needs. Extensibility 
also, can be provided by vendors by optional sets of data 
objects. This feature becomes increasingly important as it 
evolves into a means for coordinating software development 
at each stage of the application life cycle. User-defined 
entities are sometimes subject to constraints, such as, a 
new entity must assure the relationships of a predefined 
entity. To use the extensibility feature of a DD/DS effecti- 
vely three common-sense rules are applied: 

. Maintain consistency in the design of the metadatabase. 


: Maintain a proper balance in the definition of entities 
and attributes. 


; Use clearly defined, unambiguous attributes. 

Another important feature of a DD/DS is the ability for 
entity occurences to exist in different states such as test, 
production, and historic [Ref. 24]. Test status information 
is used to document evolving systems and uncopleted changes 
to production systems. Production status occurences in the 
DD/DS are reflections of operational systems. Finally, his- 
toric status reflects superseded metadata, thus, providing 
an audit trail of changes to the DD/DS. 

A DD/DS includes one or more interfaces that allows the 


user to interact with the DD/DS, as well as interface faci- 


lities to other systems. Such interfaces include: 
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. - Command languages (maintenance, report and query, data 
structure interface commands, extensibility and status- 
related commands, security commands, processing control 
commands, and administrator commands). 

; Screen oriented interface. 


. A fixed format batch data entry facility. 


: Interface that allows user written application programs 
to access the DD/DS. 


: Support interface to Report Writer and Query Systems. 
: Edit and validation criteria. 

: Test data generation. 

: Application development. aids. 

These interfaces depend on the nature of the DD/DS. The 
contents of a DD/DS can be extremely broad, as its basic 
structure lends itself to the control of a wide variety of 
processes. 

A final feature is the DD/DS control and security. 
Control in a database environment can be defined as: 

the plan of organization and all the coordinate methods 

and measures adopted by an enterprise to safeguard its 

assets (Ref. 25]. 

Check the accuracy and reliability of the data contained 
in the database, promote operational efficiency and encoura- 
ge adherence to prescribed managerial policies. Control is a 
positive force designed to direct an operation to its 
successful completion. The DD/DS function is essential in 
controlling the consistency and use of data resource. 

The primary control issues that a DD/DS addresses are 


the following: 


II 


e Data definition control through proper documentation. 


. Data usage control through use of a proper accounting 
subschema. 


: Enforcement of standards control through facilities en- 
forcing standards. 


When an organization acquires its own DD/DS system and 
depends on it, the continued proper operation of that system 
is important. The losses which follow from failure can be 
considerable. Security and privacy in the operational level 
requires that a robust and correct system is in use and 
understood by all involved, and that adequate enforcement of 
the system exists. The security and privacy issues in a DD/ 
DS concern: 


Access to the system (prevent unauthorized access, and 
permits authorized access). 


Damage to data, software and equipment. 

Most manufacturers do supply some security and privacy 
controls as part of the operating system or the DBMS. 
Security mechanisms. can be provided by the DD/DS, but remain 


relatively unexplored today. 


C. FUNCTIONS OF DD/DS 
The typical functions performed by a commercially avail- 
able DD/DS are listed in Fig. 19. These functions are: 


DD/DS Maintenance: 
interprets and processes requests to add, change or de- 
lete contents of the DD/DS i 


Extensibility 

enables the DD/DS structure to be extended by the 
definition of additional entities, relationships, and 
attributes. 
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Figure 19--DD/DS Functions [Ref. 26) 
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. Report Processor: 
provides predefined reports, the ability to customize 
reports and user-defined reporting capability. Common 
predefined report types include: 


(1) name listings 

(2) relationship reports 
(3) detail reports 

(4) summary reports 

(5) matrix reports 


(6) graphical reports 


Query Processor: 
allows English-like queries of the DD/DS (used for low 
volume retrievals). 


° Convert Functions: 
reads application programs, libraries, and schemata and 
generates input transactions for the DD/DS Maintenance 
Function which describes the detected metadata. 


Software Interface: 

provides a formatted pathway enabling the DD/DS to 
provide metadata to other software systems and enables 
these systems to retrieve and update information in the 
DD/DS. 


Exit Facility: 
enables the  vendor-supplied routines to be extended 
(not available in all DD/DS). 
: Database Management: 
performs database management tasks. Security, integrity 
concurrency control, and internal access for the data- 
base. This function is often performed by utilizing an 
existing  DBMS, nevertheless, DBMSs do not provide all 
necessary subfunctions of this function. 
These software interfaces, and convert function  capabi- 
lities, and permit the DD/DS to be intergrated with other 
software packages. The facilities used for that purpose 


allow direct and indirect access to the DD/DS; and automati- 


cally capture the metadata used by other systems. 


- 
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D. DD/DS A TOOL IN SDLC 

The DD/DS is a software tool originally intended as a 
documentation aid for data management. The software documen- 
tation feature became the primary goal of developing  DD/DS 
in the mid 1970s. Since then the spectrum of applications 
for which DD/DSs have been used has been wider. DD/DSs have 
been found useful in many areas of computer processing and 
data resource management. 

The British Computer Society (BCS) established a study 
group, the Data Dictionary System Working Party (DDWP) in 
January 1975. Over a period of time the BCSDDSWP worked to 
produce a report on the need for ae the facilities which 
should be provided by a DD/DS; and related database design 
and operational units. They studied the currently available 
DD/DSs and the related design aids. They identified. data re- 
cording and analysis needs for the design of information 
systems. They specified requirements for aids to database 
design, maintenance and operation, and considered which of 
these requirements were automatable. The report of this stu- 
dy group was published in late 1977. This study emphasizes 
the shift to expanding the functions of a DD/DS from a soft- 
ware tool with cataloging the data in an existing database, 
into an adjunct to design the database itself. This study 
also emphasizes the use of a DD/DS throughout the complete 
specification, design and implementation stages of the SDLC. 


Particular functions which could be performed would be: 


39 


: data analysis, to determine the data structure; 


: functional analysis, to determine the way in which 
events and functions use data; 


< database or conventional file design; 
: storage structure design; 
: operational running of the application systems; 
: collection and evaluation of performance statistics; 
: database tuning, to improve performance; 
: application maintenance, and database restructuring. 
The BCSDDSWP Report further recommended that the DD/DS 
should provide two sets of facilities: one set "the con- 
ceptual data model", would record and analyze requirements 
independently of how they were to be met; the second set the 
"implementation data structure", would record design decisi- 
ons in terms of the database or file structures implemented, 
and the programs that would access them [Ref. Pa m. For both 
facilities it is necessary to record data usage, as well as, 
data structure, giving rise to four areas of information 
which can be identified. Fig. 20 depicts these four areas. 
The DD/DS should relate definitions - the implementa- 
tion data structure to the parts of the conceptual data 
model they describe. Recording the mapping documents, design 
decisions, and clarifies the decisions which have been made. 
The first stage of any Information System Development 
Life Cycle is the Planning Phase or the Perception of Need 
Phase. The purpose of this activity is to determine the fea- 
sibility, technical, and economic trade-offs for a planned 
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Figure 20--The Information in a DD/DS [Ref. 27] 
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system. It is strategic in nature, a continuous process 
throughout the active life of an Information System. This 
System may already exist in the organization, requiring some 
new kind of functionality, or may be a new one. 

The planning process begins with an initial assessment 
of the current environment, and continues with an analysis 
of current usage, and determination of future requirements. 
These activities determine the data needed; the data that is 
already available; the potential conflicts and redundancies; 
and the impact on existing systems, and the potential users. 
By their nature they require a high degree of consistency 
and coordination. The DD/DS can be used to support tHeee 
activities , as a documentation tool, as well as, a design 
aid. Information about the data usage, their relationships, 
and attributes, can assist the analyst in recording the flow 
of data across functions. Conflicting usages can be identi- 
fied and resolved, and redundant data can be removed from 
the aha eon Analysts can also use the DD/DS to predict the 
impact of a proposed change and define what actions should 
be taken to prevent unwanted side-effects. Analysts must 
consider the DD/DS as a comprehensive tool for collecting 
all the facts necessary for the clear and complete statement 
of the problem, and providing data to test the solution. 

Leong Hong and Plagman (Ref. 281, recommend the fol- 
lowing use of a DD/DS in support of the information systems 


planning and modeling process (Fig. 21). 
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Steps in Information Use of DD/DS 
Systems Planning 


| | 
| | 
| | 
l | 
| | 
Ii Definition of business Document business | 
| function functions and input/output | 
| | 
| Definition of data Document data clusters and | 
| clusters identify redundancy | 
| | 
| Definition of data Document usage and | 
i cluster usage perform analysis | 
| | 
l Definition and analysis Document transactions and | 
l of transactions perform analysis | 
| | 
| Development of a Document data structure and | 
| conceptual data model perform impact analysis | 
| | 


Figure 21--DD/DS Support for Systems Planning (Ref. 28] 
The second stage is the Requirement Definition and 
Analysis Phase. This phase describes how the "real world " 
Operates. What needs to be done to achieve the predefined 
goals. What icis Of reports are needed; what activities 
will produce these reports; and where are the sources of the 
needed information? Requirement Definition and Analysis can 
be a difficult process that requires manipulation of the 
business cns and of detailed conceptual model. The use 
of a DD/DS in that phase facilitates the documentation of 
the requirement definition and supports the analysis. 
Lefkovits, et. al. (Ref. 291] identifies four very impor- 
tant results, that derived from the use of a DD/DS in this 
phase: | 
(1) The requirements that are specified can be analyzed 


to assure that the system being defined meets the 
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objectives associated with the strategic, tactical, 
and operational management of the enterprise. The 
importance of a DD/DS is that it can be used to store 
the different views of data which are associated 
with these three levels of management. Additionally 
the data that is specified as part of the requirement 
definition can be validated against the DD/DS. Thus, 
the entire enterprise and its functions can be 
modeleded in the DD/DS. 

(2) The requirements in the DD/DS can be analyzed for 
internal consistency. This solves the problem of con- 
flicting requirement specifications. 

(3) The DD/DS containing the description of the system to 
be changed, facilitates the decision on the cost ef- 
fectiveness of the modification. 

(4) Finally, the availability of the requirements in a 
well-organized and  processible manner will tend to 
improve the quality and efficiency of the tasks that 
must be performed in the succeeding stages. 

The design phase, the third stage, is a critical phase. 
This is due to the fact that management-after another review 
of justification for the undertaking- will postpone the pro- 
ject. Normally, because only a general or overview design of 
the project is prepared. The objectives of the design phase 
are intended to find a "how-to" solution. This phase is pos- 
sibly divided into logical and physical parts, to allow 
logical aspects to be defined before any implementation con- 
Siderations might affect the more abstract considerations. 
During this phase an implementation data structure is also 
constructed. Use of DD/DS during this phase facilitates mo- 
deling of data structures and the process of database design 


The DD/DS should be used in the logical and physical de- 


sign. It stores the descriptions of the system components, 
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such as, subsystems, program modules, data structures, data 
access techniques, and data flow. It documents the logical 
and physical design, by describing the data required by the 
programs. It provides flexibility by allowing individual 
findings or decisions to be recorded in the appropriate 
places in the DD/DS. It provides reference to any item of 
data of interest to the individual. Multiple user views can 
be recorded in the DD/DS. Multiple designs can be generated 
efficiently for performance testing and simulation. Conver- 
sion of data can be verified. Further, the DD/DS should be 
used actively to assist, by generating the DBMS control 
blocks from the metadata. | 

The Implementation and Test phases deal with the col- 
lection, of data, coding of programs, testing of both data. 
and processes, and overall system debugging. . The DD/DS is 
one of the few tools that provides any degree of coordina- 
tion and control over the tasks that are performed in this 
phase. During input to the implementation model, the systems 
analyst can call on the DD/DS to perform consistency checks 
to ensure that the new data is complete and correctly 
formatted. Validation by the DD/DS can confirm proper map- 
ping between the data model and the implementation model. 
Since the DD/DS -monitored by the DA- contains information 
as to who may have access to specific data sets, records, 
segments etc., access control also is gained. Finally, the 


DD/DS in this phase is very useful when it can generate 
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metadata; including a data division for a COBOL program, or 
a schema for a DBMS, or the required metadata for any other 
software component. 

During the Operation phase, next stage, there may be a 
need to refine the system to improve its performance. The 
DD/DS can be used as a tool to support a smooth transition 
between the new and the old system; or to support a smooth 
start up of a new system. It can store instructions for the 
Operational staff in many areas: descriptions of various 
activities; instructions for actions to be taken in case of 
abnormal termination of a process; statistics about the 
operation of program in the system, etc. 

The Maintenance phase is the last phase in the SDLC. Tut 
. is the phase where reorganizations or modifications are 
taking place. Some authors argue that this stage is perhaps 
a total or partial iteration of the entire SDLC; and this is 
true. Using the DD/DS appropriately in the SDLC we can 
maintain any system under development or operation. 

Fig. 22 summarizes the SDLC functions which can be sup- 


ported using the DD/DS features. 


E. BENEFITS OF USING DATA DICTIONARY/DTRECTORY SYSTEMS 
A DD/DS impacts the management of an organization by 
improving management’s control and knowledge of the data 


resource. This control and knowledge is achieved using a 


software tool--the DD/DS. The benefits that are derived vary 
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Figure 22--The Use of a DD/DS in the SDLC 
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from one organization to another. In a database environment 
with a larger number of data elements, relationships, and 
relatively high number of changes, the benefits with a DD/DS 
will be higher. 

These benefits also, seem to increase as the sharing of 
data elements between programs increases. 

Some of the benefits using a DD/DS in an enterprise in- 
volve the following aspects: 


Enables management to enforce the data definition 
standards. 


Minimizes unwanted data redundancy. 
: Assists in securing sensitive data. 


Assists management in quickly determining impacts of 
proposed changes to a system. 


Provides better data integrity than a file management 
system or any variety of DBMS. 


Assists management in ensuring complete and accurate 
changeovers in the implementation of new systems. 


R Provides information about the creation, usage, and 
relationships of data. 


Reduces the clerical load of a DA/DBA. 


Gives a DBA control over design and use of a database 


by: 
(1) controlling and documenting formulation, meaning 
and usage of data structures; 
(2) evaluating and controlling data redundancy; and 
Cay providing accurate data definitions for programs 


Supports in the analysis of an organization’s data flow 
by providing a method to track documents which flow 
through an organization. 
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e Provides a central source of information for the 
designers and analysts to prevent data redundancy and 
incosistency. 

. Generates test data for designers. 


: Provides documentation on systems design. 


: Enforces data definition standards during program 
coding. 


Automatically generates code. 


: Improves the accuracy of finished programs by genera- 
tion of test data and checking results automatically. 


s Provides cross-referencing to assist in implementing 
approved changes to a system. 


: Implements automatically amendment to operational 
systems. 


. Provides documentation on changes to a system. 


Aids operational personnel in the creation of JCL 
parameters. 


: Determines the source of data (including invalid data). 
Aids "me auditing function. 
Interfaces to application program development tools. 
The above mentioned benefits may be considered as tan- 
gible benefits. One last benefit that  DD/DS provides does 
not have a tangible value. It is the benefit that is derived 
from a properly implemented and well managed DD/DS. The 


trustworthy information for all users in an enterprise. 
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IV. COMMERCIALLY AVAILABLE DD/DS PACKAGES 


A. OVERVIEW 


In 


this chapter most of the currently, commercially 


available DD/DSs are discussed. The intention is to reveal 


the 


differences between theory, promise, and practice. The 


major problem is a lack of a standard methodology which ties 


a DD/DS into the system life cycle. 


The available commercially DD/DS packages can be grouped 


into four categories: 


(1) 


(2) 


(3) 


(4) 


Independent DD/DS: 
a free-standing package which can be used in a non- 
DBMS environment. 


Independent DD/DS with interfaces to other DBMSs: 
a free-standing package that optionally provides 
interfaces to one or more DBMS. 


Dependent DD/DS: 

the software package is designed to co-exist with a 
particular DBMS. It is dependent on the DBMS and used 
as a front end process to the particular database or 
DBMS. 


Embedded DD/DS: 

a fully intergrated DD/DS with the DBMS. It can en- 
hance the data control and management capabilities of 
a particular system. The DD/DS function usually is 
part of the data definition function, and the meta- 
data is stored as part of the database for the DBMS. 


The principal advantages of the independent DD/DS are 


the following: 


They are self-contained, performing their functions in- 
dependently of any database(s) or DBMS. 


They can store computer data, as well as, nonmechanized 
data. 
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e There is less risk involved of commitment to a database 
management system in implementing an independent DD/DS 
than a dependent one. 


: They do not need all the data descriptions required for 
a database at one time, as the dependent one. 


: They can support one or more DBMSs through their inter- 
facing capability. 


The principal advantages of the dependent or embedded 
DD/DSs are the following: 


. They provide information as to the location of data 
within the physical database. 


: They can serve as a much more powerful control tool 
when ¡integrated with the DBMS, because the database 
designer and the users will have to enforce the DD/DS 


as a tool for documentation and control of the data. 


: They provide data validation through embedded range and 
criteria checking. 


; Technical software coordination issues between a depen- 
dent DD/DS and a DBMS are minimized. 


: The selling effort to current DBMS users is easier. 
: The development effort for the DD/DS is much easier. 
Seventeen systems were selected based on a survey of the 

literature (Ref. 30,31,32]. The criteria that were used pri- 
marily concern the following areas: 

; Type of DD/DS. 

: Language used. 

: Entity names used. 

: Expansibility features. 

: Status capability. 

: User-defined t 


: Security levels provided. 
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: Software interface packages and methods. 


Fig. 23 contains the selected areas of interest. These 


DD/DSs are designed especially for large mainframes. 


SYSTEM NAME 


DATA CATALOGUE 


2 


DATAMANAGER 


PRIDE/LOGIC 


LEXICON 


EDICT 


TIS DIRECTORY 


ADABAS DATA 
DIRECTORY 


DATA CONTROL 
SYSTEM 


VENDOR FIRST RELEASE/ | CATEGORY 
LAST RELEASE 


Synergetics 1974/1977 Independent 
Corp. 

Management 1973/5 Independent 
Systems and 

Programming 

(MSP) 


M. Bryce and 1974/- Independent 
Associates 









































ANDERSEN A. I9 2^ Independent 
and Co. 

Independent 
Infodata T or DBMS 
Systems Inc. in development|Application 
Cincom 19797 Embedded 
Systems Inc. in development 
Software AG 19787 DBMS- 
of Nortn in development|Application 
America, Inc. 
Cincom 1976/1980 DBMSZ 
Systems Inc. Application 


- 


Figure 23--Features of Commercial DD/DS Packages 
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SYSTEM NAME VENDOR FIRST RELEASE/ [CATEGORY 
LAST RELEASE 


1976/- DBMS- 
Application 


DATA CONTROL Harvey 
SYSTEM (DCS) Systems, Inc. 





DATA DICTIONARY |Sperry Univac £96175 DBMS- 
SYSTEM Application 
(DDS 1100) 


pg A DBMS- 
Application 


DATA DICTIONARY |International 
SYSTEM (DDS) Computers 
Beas (ICL) 


1979/3 DBMS- 
Application 


DATA DICTIONARY ¡Applied Data 
Research Inc. 
(ADR) 





DB/DC DATA IBM 1974/- DBMS- 
DICTIONARY Application 


19827- DBMS- 
Application 


DICTIONARY/204 Computer 
Corporation 
of America 








|Integrated Data |Cullinane 1977/- DBMS- 
Dictionary Corporation Application 
(IDD) 
Extended Data Intel Systems 1970/1980 DBMS- 
DICTIONARY Corporation Application 
( XDD) 
UCC TEN University 3707 - IDEBITG- 


Computing Application 
Center (UCC) 


Figure 23--Features of Commercial DD/DS Packages (continued) 
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SYSTEM NAME {SOURCE ENTITY NAMES 
LANGUAGE 


DATA COBOL Element, Group, Record, Resource, 

CATALOGUE 2 Form, Task, Report,File, Module, 
System, User 

DATAMANAGER |Assembler System, Program, Module, Data- 
base, File, Group, Item, Segment, 
PCB 

PRIDE/LOGIC |COBOL System,Subsystem, Procedures, 
Programs, Modules, Files, Inputs, 
Outputs, Call arguments, Records 
Data elements 


LEXICON Assembler and|Data elements: Item, Subgroup, 
COBOL Group, Record, Entry,Database, 

File,Sensitivity 

Processing entities:Validator, 

Program, System 













EDICT Primarily Element, Database 
PLZI 

TIS Assembler System data: Component, Reserved 

DIRECTORY word, Mask, Edit, Translate table 
Schema data: Environment, File, 
Environment file, Buffer pool, 
Internal record, Physical field, 
Subschema, Access set, External 
field 
User data:User, Procedure, 
Expression, Equation 

ADABAS DATA |Assembler, Fields, Relationships, Databases, 

DIRECTORY COBOL, Files, Field verification, 

NATURAL Procedures, Owners/Users, User 

views, Programs, Modules, System, 
Reports, Responce codes 

DATA CONTROL] COBOL and User, Report, Program, Document, 

SYSTEM MANTIS System, Source, Element, Database, 

(Cincom) File, Transaction, Physical/ 
Logical Element 

DATA CONTROL | COBOL Schema, Set, Area, Subschema, 

SYSTEM (DCS) Record, Field, Program 


Figure 23--Features of Commercial DD/DS Packages (continued) 
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COBOL 
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ENTITY NAMES 


Module, Run, Unit, DBA, Analyst, 
Application, Sshema, Subschema, 
Area, Record, Group, Data-itenm, 
Data-name, Set, Database, Stream, 
Procedures, Run, File, Record, 
Relation 


Entity, Relationship, Attribute, 
File, Virtual file, Record, Group, 
Item, Operation, Event, System, 
Program, Module, PMAP, DMAP, Area, 
Schema, Subschema, Set 


Database, Area, File, Key, Element, 
Record, System, Person, Job, Step, 
Authorization, Module, Program, 
Report 


Database, Segment, Element, PCB, 
SYSDEF, System, Job, Program, PSB, 
Module, Transaction, DDUSER 


File, Group, Record, Field, 
Procedure 


User, System, Program, Entry, Point 
Module, Element, Record, File, Task 
Queue, Map, Panel, Line, Physical 
terminal, Logical terminal, 
Destination, Message 


Application, Work unit, Program, 
File, Work area, Work structure, 
Database, Schema, Subschema, Item, 
User 


Database, Shared secontary 
index, Data set group, Index data 
fields, Segment, Index data field 
list, Lchild, Field, Program, Job, 
PSB application, Module, and 21 
more for message formatting 

and communications 


Figure 23--Features of Commercial DD/DS Packages (continued) 
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SYSTEM NAME |EXPANSIBILITY 


DATA Define additional entities 
CATALOGUE 2 |relationships, and attri- 
butes. Additional entities 


used with IMS, TOTAL, DMS, 
ADABAS, OMS- 1100, S2000/80, 
and IDMS 


DATAMANAGER |Three additional entity 
structures supported: 
entity types, attributes, 
and relationships based 
On existing ones. Also, 
additional entity types 
for each DBMS it supports 





PRIDE/LOGIC |None. All metadata relates 
to PRIDE methodology 


LEXICON Provides capability of 
creating conventional file 
and database oriented data 
descriptions from existing 
dictionary database 
entities 


EDICT None 
TIS None 
DIRECTORY 


ADABAS DATA {Define additional entities 

DICTIONARY attributes, and relation- 
ships, via changes to the 
D/D schema ~ 








STATUS CAPABILITY 


Status codes: 
Existing, Proposed, 
Obsolete, User- 
defined, and 
version numbers 


Version numbers. 
Status facility 
allows partition- 
ing by time, status 
project etc. 


One status for 
each of nine deve- 
lopment phases, 
modification, 
improvement,active 


Status codes: 
Proposed, Approved, 
Concurrent, 
Effective 


None 


User-defined 
statuses 


No status codes or 
version numbers. 
Status capability 
via separate D/Ds 


Figure 23--Features of Commercial DD/DS Packages (continued) 
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SYSTEM NAME |EXPANSIBILITY STATUS CAPABILITY 






Test/production 
versions for the 
system entities. 
No status facility 


Define additional entities 
attributes, and relation- 
ships, through MANTIS 


DATA CONTROL 
SYSTEM 
(Cincom) 










Define additional None 


attributes 


DATA CONTROL 
SYSTEM (DCS) 


















DDS 1100 Define additional entitiesļ|Proposed, Approved, 
relationships, and Obsolete, Active, 
attributes All, and versions 

Test, Training, All, 
Production and 
User-defined 

DATA Operational/User- 

DICTIONARY defined codes and 

(DDS) version numbers 

DATA Define additional entitiesjProduction, History 

DICTIONARY relationships and and 

(ADR) attributes version numbers 








DB/DC DATA Define additional entities|Production,lInstal- 



















DICTIONARY relationships and led and User- 
attributes defined 
DICTIONARY/ Define additional entities|None 
204 relationships and 
attributes 
Integrated Define new attributes; full]|Production, and 
Data (IDD) Expansibility planned and Historic, and 
Dictionary version numbers 
XDD DATA Define additional entities|Status codes: Test 
DICTIONARY relationships and Production,Load, 
attributes Development and 
Obsolete and 
version numbers 
UCC TEN None Test and 


Production status 
= with 256 sides 


Figure 23--Features of Commercial DD/DS Packages (continued) 
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ON-LINE|USER-DEFINED 
SYSTEM NAME |QUERY REPORTS SECURITY LEVELS 


Customazation via 
DATA YES macro routines; ad- Three types of 
CATALOGUE 2 ditional reports via |password;j; 
call and file extrac-jentity type, 
tion capabilities. and command 
New reports erquire levels 
user software 

















Customazation via 
macro routines; ad- 
ditional reports via 
call and file extrac- 
tion capabilities. 
New reports rely on 
the user. Interface 
facility 


DATAMANAGER |YES Security 
assigned at 
three levels; 
access, alter 


and remove 


Function, and 
entity type 
levels 


PRIDE/LOGIC YES Via extraction 


facilities 












Via extraction Use of D/D spe- 






LEXICON YES facilities (LEXICON cial security 
Access Module module;Security 
assigned at 
three levels; 
user supplied 
password 
Entity type and 
EDICT YES Via the user-defined |others via user 
i language defined securi- 
ty routines 
TIS MES Via comprehensive Command level 
DIRECTORY retrieval component 
Entity, attri 
ADABAS DATA |YES Via NATURAL , a bute, attribute 
DIRECTORY program development value, and 


facility function (read 
and write 


Figure 23--Features of Commercial DD/DS Packages (continued) 
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ON-LINE|USER-DEFINED 
REPORTS 


QUERY 


SYSTEM NAME 





DATA CONTROL | YES Through the Socrates 
SYSTEM Report Writer 
(Cincom) 


DATA CONTROL|YES via|Report options 
SYSTEM (DCS) ¡QLP 
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Figure 23--Features of Commercial DD/DS Packages 
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SECURITY LEVEL 


Password faci- 
lity; security 
at element 
level may be 
specified; ad- 
ditional secu- 
rity through 
"user exit" 


None 


Command and 
entity 
occurence 


Entity type, 
function, user, 
and operational 
status 


Entire 
occurence 


Sign on, status, 
and entity type 


Login; 
Security levels 
planned 


User view and 
record level 


Element entity 
and 


command 
Command; more 
added via se- 


curity tables 


(continued) 


DEPENDENT DBMS OR SOFTWARE INTERFACE 
SYSTEM NAME FILE ORGANIZATION PACKAGES 


DATA COBOL relative " |COBOL, "EE 
CATALOGUE 2 files 





DATAMANAGER VSAM or BDAM COBOL, PL/I, Assembler, 
ADABAS, IDMS, IMS, DL/I 
Methodology interface, 
TOTAL (DDL processor), 
52000 


PRIDE / LOG Le COBOL relative COBOL, PL/I, =e 
files ADF (design aid), 
PDS if, 90s, Deva 


LEXICON IMS, IDMS, TOTAL IMS, OS files, IDIS 
TOTAL 
EDICT Sequencial Fiie or |INQUIRE (DDL processor) 


INQUIRE DBMS 





TIS TIS-DBM DBM (DBCS), QUERY 

DIRECTORY COMPONENT (Query proces- 
sor), COMPREHENSIVE 
RETRIEVAL(Report Writer) 


COBOL, PL/I, NATURA 
ADABAS DATA ADABAS ADAWAN (DDL processor), 
DIRECTORY Assembler, ADAMINT 
(Preprocessor), ADASCRIPT 
| (query processor), ADACOM 
(report writer) 
DATA CONTROL | TOTAL TOTAL (DDL processor), 
SYSTEM COBOL 
(Cincom) 


DMLP (preprocessor), 
DDS 1100 DMS-1100 DMS-1100 (DDL proces- 
sor) 


Figure 23--Features of Commercial DD/DS Packages (continued) 
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SYSTEM NAME 


DATA 
DICTIONARY 
(DDS) 


DATA 
DICTIONARY 
(ADR) 


DB/DC DATA 
DICTIONARY 


Integrated 
DATA (IDD) 
Dictionary 


XDD DATA 
DICTIONARY 


UCC TEN 


DEPENDENT DBMS OR 
FILE ORGANIZATION 


IDMS (ICL version) 





DATACOM/DB 


INS or DOS PL/I 


IDMS 


System 2000/80 





IMS HIDAM 
Databases 


SOFTWARE INTERFACE 
PACKAGES 


IDMS (DDL Processor) 
COBOL, 
COBOL programs 


DATACOM/DB (DDL Proces.) 
DATAREPORTER (Report 
Writer), COBOL, 
DataQuery (Query Proc. ), 
Datacom/DL (Preproces. >, 
LIBRARIAN 


COBOL, PL/I, Assembler, 
MARK IV, IMS (DDL Proc. ) 
DBDA (design aid),GIS 
(Report Writer), other 
software 


IDMS (DDL processor). 
DML Processor (preproc. ) 
CULPRIT (report writer) 
OLQ (query processor) 
IDMS-DC (TP monitor) 


System 2000/80 (DDL 
Processor), COBOL, 
COBOL PLEX (Preproc. ) 


IMS (DDL Processor and 
System generator), 

MFS (Terminal Monitor), 
COBOL, PL/I, Assembler, 
GIS (General Information 
System) 

ADF (Batch Program 
generator) 


Figure 23--Features of Commercial DD/DS Packages (continued) 
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SYSTEM NAME 


DATA 
CATALOGUE 2 


DATAMANAGER 


PRIDE/LOGIC 


LEXICON 


EDICT 


TIS 
DIRECTORY 


ADABAS DATA 
DIRECTORY 


Figure 23--Features of Commercial DD/DS Packages 


INTERACE METHODS/OPTIONS 





File extraction/Prefix, 
Suffix, level number, 
sequence numbers,and line 
identifier, include 
comments and initial 
values. 









interface 
delete 


File extraction, 
commands/Replace, 
and insert editing 
options, include comments 
initial values, and 
condition names 


File extraction, 
interface commands/ 


File extraction generated 
by assembly language, 
COBOL, or PL/I programs, 
by means of a transaction/ 


File extraction/ 


Interface commands/ 
Examine and update user 
views, save the RDL state- 
ments in the D/D 





File extraction/Prefixes, 
alternate functions, 
sequence numbers 
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USERS/COMMENTS 


Approximately 
250 users 


Approximately 
600 users 


Approximately 
300 users 


In January 1980 
Andersen, A. and 
Co. made the 
decision to with- 
draw LEXICON from 
the market place. 
Support of the 
system ends 1985 


Approximately 
150 users 


Approximately 

10 users. 
Integrated pre- 
compiler planned 


Approximately 
1000 users 


(continued) 


SYSTEM NAME 


DATA CONTROL 


SYSTEM 
(Cincom) 


DDS 1100 


DATA 
DICTIONARY 
(DDS) 


DATA 
DICTIONARY 
(ADR) 


DB/DC DATA 
DICTIONARY 


Integrated 
Data (IDD) 
Dictionary 


XDD DATA 
DICTIONARY 


UCC TEN 












INTERFACE METHODS/OPTIONS [USERS/COMMENTS 





Approximately 
130 users. It is 
also known as 
DATA DICTIONARY 


File extraction/More 
direct interaction if used 
with other Cincom products 









Approximately 
30 users 


Interface commands/ 


Approximately 
70 users 


File extraction,dynamic/ 
Renaming capability, 
syntactic check on names, 
inclusion of comments, 
initial values and condi- 
tion names, prefix and 
suffix capability 


File extraction, interface|Approximately 
commands/includes comments|1OO users 


File extraction, interface 
commands/Prefix, suffix, 
level numbers, includes 
comments. Segment lengths 
and field start positions 
recalculated via the 
RECALCULATE-SEGMENT 
function 


Not available 


P d 


Interface commands/ Approximately 


100 users 


Approximately 
40 users 


File extraction/Sort the 
generated data descrip- 


tions, select structures 
within views, renaming 


File extraction/suffix, 
prefix, and replace 
editing options, level 
number controls, include 
comments, condition 
names, and initial values 


Approximately 
300 users 


Figure 23--Features of Commercial DD/DS Packages (continued) 
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SYSTEM NAME 


DATA 
CATALOGUE 2 


DATAMANAGER 


PRIDE/LOGIC 


LEXICON 

EDICT 

TIS DIRECTORY 
ADABAS 


DATA CONTROL 
(Cincom) 


DATA CONTROL 
(DCS) 


DDS 1100 
DDS (ICL) 


(ADR) 
DATADICTIONARY 


DB/DC DATA 
DICTIONARY 


DICTIONARY/204 


Integrated 
(IDD) 


XDD (Intel) 


UCC TEN 


| IBM 


HARDWARE REQUIREMENTS 


IBM 360, 370, 30xxXx, 43xx 
Univac 1100, Honeywell 66 series 


IBM 360, 
FACOM M series, 


370, 30XX, 43XX 
Siemens 7000 


IBM 360, 370, 30xx, 43xx, Bourroughs 
Honeywell series 60 & 6000, HP-3000, CDC 6600 
DEC 10 & 20, VAX, Univac 1100, ICL 1903, 
Cyber 175, Prime 750 


IBM 360, 370 


IBM 360, 370, 3Oxx 43x~x 


IBM 360, 370, 30xx, 43xx 


IBM 360, 370, 30xx, 22x 


dou d 
Century & 


JOxx, 43xx 


NCR Criterion 


Univac 1100 


Univac 1100 


ICL 2900 

IBM 360, 370, 30xx, 43xx 

IBM 360, 370, 30xx, 43xx 

IBM 360, 370, 30xx, 42x 

IBM 360, 370, 30xx, 43xx, 

IBM 360, 370, 30xx, 43x~x 

CDC 6000, 70, 170, Univac 1100 
IBM 360, 370, 30xx, 43xx 


Figure 23--Features of Commercial DD/DS Packages 
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B: PROBLEMS AND WEAKNESSES OF DD/DS PACKAGES 

Data Dictionary/Directory Systems have not been avail- 
able for a long time. Nevertheless, organizations began to 
use them more and more; gaining control and experience from 
their usage. The commercially available DD/DS packages are 
not presently capable of providing all functions envisioned 
for their use. This is not to say that the DD/DSs are worth- 
less, just that there is much room for improvement. 

Immediate observations that can be made concerning their 
capabilities and usage in organizations are the following: 


(1) The majority of the commercially available DD/DSs are 
oriented for use toward a particular DBMS. 


(2) There is a broad divergency of opinion as to their 
scope. On the one hand, the scope of the DD/DS may be 
quite narrow, covering only the database definitions 
of a DBMS. On the other hand, its scope may be quite 
wide, covering all the data resources of an enter- 
prise. The argument for independence of DD/DS and 
DBMS is not supported by the fact, that the available 
DD/DS packages provide the range of functions, which 
can be expected. Some organizations using early DD/DS 
have centered its use around the directory function, 
and the DD/DS has become a database definition inter- 
face. Only advanced, well managed and large data 
processing installations have aquired and use a DD/DS 
aS a documentation tool. Further, only few of these 
DD/DS installations use it as a tool for resolving 
data conflicts, and constructing clear and un- 
ambiguous data definitions. 


(3) There is a lack of generally accepted standards, for 
what constitutes a good data definition. 


(4) There is no standard definition of "data element" 


(5) It is not quite clear which important characteristics 
Of data should be recorded in the DD/DS. 


(6) There is no accepted discipline of conceptual or 
logical database design. 
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Among the problems in the use of data of DD/DSs there is 
a lack of a comprehensive methodology which ties its use 
into the SDLC tRef£.1331 The SDLC approach is considered as 
a management tool to plan and control systems development 
efforts. DD/DSs are considered as technical support tools 
associated with database management systems. On the one hand 
the DD/DS has to support and measure the progress of a pro- 
ject's development life cycle. On the other hand, the SDLC 
must specify activities in terms of the DD/DS. Evidences of 
this situation are considered in the following: 

à No standard definitions of entity types. 

: Methodologies used, such as, structured analysis, refer 

primarily to the flow of data rather than the flow of 
control and usage of a DD/DS. 
Most DD/DS are oriented toward recording data (defini- 
tions and structures), as they are in a physical 
database or COBOL file. They are not oriented toward 
the highly dynamic design process itself. 

: A few commercially available  DD/DSs, provide graphic 
representations of the relationships between the object 
identifiers, which is very important for logical design 


methodologies. 


: Only a few of the current  DD/DSs are integrated, and 
most of them act like passive DD/DSs. 


The above mentioned problems do not imply that the DD/DS 
is not of great promise in assisting the management of the 
information resource. It will be undergoing major change 
during the years to come. It will be more integrated, its 


use will be an accepted part of the systems development and 


maintenance life cycle. Its use among non-DP staff will be 
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much more extensive than at the present time. To some degree 
today, because of product deficiencies and the lack of well 
established methodologies for usage, DD/DS seem to fail in 


practice to achieve its goals. 


OF EXPECTATIONS ON THE FUTURE OF DD/DS 

The evolution of DD/DS since 1970s, like that of all 
computer software, was the result of two driving forces: The 
need of the user, and the foresight of the developer. Fach 
of them has an active role to play and each has its 
limitations. 

The need of the user today and in the future can be cha- 
racterized by broader demand for easy-to-use facilities, 
greater need for distributed processing functions, cheaper 
systems, and much more widespread use of database technolo- 


gy. Developers on the other hand, tend to see the broader 


view of a system, but occasionally they overlook important 
aspects. This is due to a lack of exposure to day-to-day 
business situations. Nevertheless, a current trend in the 


computer industry is the availability of cheaper, smaller, 
and more effective systems. The usage of DD/DS products in 
the future will be a balance of these forces. It will be 
determined by the user demand for facilities, and by user 


acceptance of the facilities fabricated by developers. 


- 


Projections on the future of  DD/DS can be classified 


into two categories:  Short-term developments, that are 
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specific and concrete, and long-term that are often trans- 
parent. Short-term projections are: 

. More powerful query and analysis capabilities. 

: More user-friendly interfaces. 


: Greater integration of DD/DS into actual life cycle 
management. 


: New structural techniques for modeling purposes. This 
will result in the ability of the DD/DS to deal with 
richer semantic constructs. 

However, the major topics for consideration is in the 
long-term development of DD/DS, and its usage in: 

: Integration and evaluation into the IRM concept. 

k Mini-and microcomputer applications; and 

. Distributed systems (networks), consisting of several 
different types of DBMSs, file managers, text editors 
that run on computers from different manufactureres. 


Inato this environment, the Data Dictionary/Directory 


System would act as a driver of the entire system. 
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V. EVALUATION/SELECTION CRITERIA FOR DD/DS ACQUISITION 


When management decides to implement a DD/DS the first 
consideration is whether to purchase a commercially avail- 
able software package or develop one in-house. This make 
versus buy decision is based on a combination of many 
factors: hardware, software, user acceptance, planning for 
data administration, cost, and many others. In summary, 
there are technical and economic factors, as well as, organ- 
ization needs. Nevertheless, selection of a DD/DS must be 
based primarily on the needs of the salen. 

To build an in-house design and implement a DD/DS 
requires three main resources: money, technical expertise, 
and time. If an organization has all three, then it would be 
possible to build a  DD/DS. Again, there is some evidence 
that inhibits the purchase of commercial DD/DS packages: 
Organizations that have non-IBM equipment have a minimum 
number of DD/DSs to choose from. Organizations with mini- 
computers and microcomputers have an even more limited 
selection. Organizations (scientific) that need a very flex- 
ible set of entity types and attributes are not supported 
by the commercially available DD/DSs. If no commercial DD/DS 
exists that will run with existing software (e.g., operating 
system or compiler), then there may be no other option than 
to build a  DD/DS. An organization considering to build its 
own DD/DS can gain in reducing subsequent operational and 
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maintenance costs; and can better match user needs than a 
generalized commercial system. 

Even though, there are some important constraint factors 
that suggest the alternative way of buying one: 

š The design and implementation of a DD/DS is a non- 
trivial task. It needs a great deal of effort, time, 
and costs a large amount of money, especially if the 
system must be continually updated. 

: Today, DD/DS is considered as a highly sophisticated 
product, providing primarily high reliability. In an in 
house developed DD/DS, it is possible that undetected 
errors might be resident for longer periods of time 
than in a commercially available system. 

: To keep track with the technology, DD/DS needs continu- 
ous enhancements, that will improve and increase its 
effectiveness in an organization. Such enhancements are 
very difficult to be done for an in-house system, and 
are better provided by outside software products. 

These reasons, and also the commercial availability of 
an increasing number of DD/DS packages suggest that the buy 
alternative should be considered as the most preferable. 

Before initiating the selection process, an organization 
must determine if a DD/DS is justified, based on an economic 
analysis of costs and benefits or savings of implementing 
the system. As stated earlier, the DD/DS is not an integral 
part of a database management system in most of the packages 
available today. As a result, the decision to buy a DD/DS 
package should to be made independently of the DBMS. It must 
be mentioned here that if an organization also needs a 


DBMS, the ideal time to aquire a DD/DS package, is betore 


selection of a DBMS. This will allow the organization to set 
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in motion a number of administrative and conceptual  adjust- 
ments that will assist and pave the way for database. 
As in any economic analysis, determining an actual dol- 


lar value for savings or benefits may be extremely difficult 


and is a subjective judgement. Benefits may be expressed in 
terms of savings, cost avoidance, improved performance, and 
hidden opportunities. Fig. 24 lists some aspects of costs 


and savings/benefits which should be considered in the 
economic analysis. Fig. 24 is not an all inclusive list; 
management should determine actual cost/benefit categories 


to be considered applicable to an organization. 


Casts BENEFITS/SAVINGS 


Acquisition System Design and 
Data Administration Development 
Staff System Maintenance 
Hardware costs Data Redundancy 
Start-up costs Database Creation 
Data collection costs Auditing Information 
Maintenance Resource 
Application System Improved Communications 
Changes 
User Training 





Figure 24--Costs and Savings/Benefits of DD/DS 
Selection of a DD/DS should be based on who will use the 
System and how it will be used, rather than what is the most 
technologically innovative system in the market.. Lefkovits, 
et al., [Ref. 341, recommend the following selection and 


evaluation process: 
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: Determine requirements; classify which requirements 
are mandatory, and which are the desirable features, 
with a corresponding point scale indicating 
importance. 


: Develop a list of features to be used in the evalua- 
tion of DD/DS. 


. Determine a mapping from requirements to features; 
multiple mappings may be possible. 


: Compare features provided by commercially available 
systems to each mapping to determine if a system qua- 
lifies for further consideration. 

: Compare those systems which qualify for the degree of 
compliance of any available desirable features, as- 
signing a point value. 

: Sum point values assigned to desirable features ot 
qualified systems to select the DD/DS which best 
meets the requirements. 

We cannot assure that this process does not include some 
risk, since subjective judgement on the part of management 
is involved. The wrong system may still be selected for many 
reasons: improper determination of requirements; usually due 
to a lack of user involvement in the selection process; 
improper qualification of features, due to technical bias of 
selection team; inconsistent evaluation of the system, due 
to different members of the selection team, as well as, a 
lack of a well-defined measurement methodology; and undue 
emphasis on features needed in the future, but not at the 
time of implementation, which could result in user dissatis- 
faction with an unnecessarily complex system. 


The system being selected and implemented, should be 


evaluated periodically to determine its performance. Often 
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the requirements and needs of an organization change, 
requiring a reevaluation of the DD/DS to determine the new 
and/or changed demands. If the DD/DS no longer meets these 
new requirements in an acceptable fashion, a new system must 


be evaluated and selected. 
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MI. CONCLUSION 


The explosive proliferation of computer usage in the 
last ten years has led organizations to increase their Data 
Processing Activities. AS more and more activities, vital to 
an organization, become automated, the actual processing of 
data and information becomes more and more strategic to that 
organization. Today, corporate management is becoming aware 
of an important asset which, until recently, has been 
virtually ignored. The asset is data. The idea of data being 
a ESOS asset is relatively new, and has developed along 
with the influence and proliferation of computers in organi- 
zations. Data must be stored, managed, protected, and 
and retrieved effectively and efficiently, as does any other 


critical organizational resource. To accomplish these tasks, 


several kinds of management software tools have been 
developed. One tool which management can utilize is the 
DD/DS. 

DD/DS is presently in an evolutionary state. Origina kiy 


started as a tool for description and documentation within a 
database, it was soon recognized that the productivity 
services, that the DD/DS could provide, were part and parcel 
of data resource control. However, DD/DS implementations 
have lagged somewhat behind that of earlier develo AA DBMS? 
The confusion regarding scope, definition and integration of 


currently available DD/DS somewhat hinders the effective 
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and widespread implementation Ot wEbDD/ZDS. Some of the 
functions which DD/DS purport to possess are still theore- 
tical in nature. Also, a lack of user education leads to 
erroneous or misdirected use of DD/DS. Nevertheless, the 
increasing interest and attention in this area has led to 
improvements in DD/DS. Recently, the data management com- 
munity has begun to recognize the DD/DS as a more general 
tool for data resource management. It pertains not only to 
database systems, but increasingly, to non-database systems 
as well, and it is further broadened by its support for job 
streams, structured systems, on-line environments, and 
Systems design. 

One area of computer processing and data resource  mana- 
gement where the DD/DS has been found to be a useful tool is 
in the SDLC. It can be used to support various activities 
Ae sughout the system development process including the 
maintenance and operation phase. Designers, analysts, and 
persons who are involved in system development and operation 
should use and rely on the DD/DS. Recent experience has 
shown that information systems developed with the aid of a 
DD/DS tend to exhibit fewer errors that need correcting, as 
well as errors which are not as serious, compared to systems 
developed in such an environment. This result, in consider- 
ation of the resources that are often required . for system 
maintenance make the DD/DS a useful tool in the System 


Development Life Cycle. 
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